NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Infinity Stream: Portable and Programmer-Friendly In-/Near-Memory Fusion

https://doi.org/10.1145/3582016.3582032

Wang, Zhengrong; Liu, Christopher; Arora, Aman; John, Lizy; Nowatzki, Tony (March 2023, ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems)

In-memory computing with large last-level caches is promising to dramatically alleviate data movement bottlenecks and expose massive bitline-level parallelization opportunities. However, key challenges from its unique execution model remain unsolved: automated parallelization, transparently orchestrating data transposition/alignment/broadcast for bit-serial logic, and mixing in-/near-memory computing. Most importantly, the solution should be programmer friendly and portable across platforms. Our key innovation is an execution model and intermediate representation (IR) that enables hybrid CPU-core, in-memory, and near-memory processing. Our IR is the tensor dataflow graph (tDFG), which is a unified representation of in-memory and near-memory computation. The tDFG exposes tensor-data structure information so that the hardware and runtime can automatically orchestrate data management for bitserial execution, including runtime data layout transformations. To enable microarchitecture portability, we use a two-phase, JIT-based compilation approach to dynamically lower the tDFG to in-memory commands. Our design, infinity stream, is evaluated on a cycle-accurate simulator. Across data-processing workloads with fp32, it achieves 2.6× speedup and 75% traffic reduction over a state-of-the-art near-memory computing technique, with 2.4× energy efficiency.
more » « less
Full Text Available
Infinity Stream: Enabling Transparent and Automated In-Memory Computing

https://doi.org/10.1109/LCA.2022.3203064

Wang, Zhengrong; Liu, Christopher; Nowatzki, Tony (July 2022, IEEE Computer Architecture Letters)

Full Text Available
Tsunami Early Warning From Global Navigation Satellite System Data Using Convolutional Neural Networks

https://doi.org/10.1029/2022GL099511

Rim, Donsub; Baraldi, Robert; Liu, Christopher M; LeVeque, Randall J; Terada, Kenjiro (October 2022, Geophysical Research Letters)

We investigate the potential of using Global Navigation Satellite System (GNSS) observations to directly forecast full tsunami waveforms in real time. We train convolutional neural networks to use less than 9 min of GNSS data to forecast the full tsunami waveforms over 6 hr at select locations, and obtain accurate forecasts on a test data set. Our training and test data consists of synthetic earthquakes and associated GNSS data generated for the Cascadia Subduction Zone using the MudPy software, and corresponding tsunami waveforms in Puget Sound computed using GeoClaw. We use the same suite of synthetic earthquakes and waveforms as in earlier work where tsunami waveforms were used for forecasting, and provide a comparison. We also explore varying the number of GNSS stations, their locations, and their observation durations.
more » « less
Full Text Available
Affinity Alloc: Taming Not-So Near-Data Computing

https://doi.org/10.1145/3613424.3623778

Wang, Zhengrong; Liu, Christopher; Beckmann, Nathan; Nowatzki, Tony (October 2023, ACM)

Full Text Available

Search for: All records